---
redirect_from:
  - /deployment/cloud/auto-suspension
---

# Auto-suspension

<SuccessBox>

Auto-suspension is available in Cube Cloud on
[Starter and above](https://cube.dev/pricing) tiers.

</SuccessBox>

Cube Cloud can automatically suspend deployments when not in use to reduce
[resource consumption][ref-deployment-pricing], which helps manage your spend.

<WarningBox>

Auto-suspension is useful for deployments that are not used 24/7, such as
staging deployments. However, **auto-suspension shall not be used for production
deployments**. See [effects on experience][self-effects] for details.

</WarningBox>

<InfoBox>

Auto-suspension is not avaiable for [production multi-clusters][ref-prod-multi-cluster].

</InfoBox>

Auto-suspension will hibernate the deployment when **no** API
requests are received after a period of time, and automatically resume the
deployment when API requests start coming in again:

<Diagram
  alt="Cube Cloud auto-suspend flowchart"
  src="https://ucarecdn.com/e9a22d59-e0af-40c5-b590-02f2566663d1/"
/>

[Development instances][ref-deployment-dev-instance] are auto-suspended
automatically when not in use for 30 minutes, whereas [production
clusters][ref-deployment-prod-cluster] can auto-suspend after no API
requests were received within a configurable time period.

During auto-suspension, resources are monitored in 5 minute intervals. This
means that if a deployment was suspended 4 minutes ago, and a request comes in,
the deployment will resume immediately and 5 minute of CCU usage will be billed.

## Effects on experience

If auto-suspension is enabled, the behavior of your Cube Cloud deployment will
experience some notable changes.

When a deployment is auto-suspended:

- [Data model][ref-data-model] compilation artifacts are discarded since the
API instances are de-provisioned.
- [Refresh worker][ref-refresh-worker] is suspended, which stops pre-aggregation
builds and prevents the pre-aggregations from being kept up-to-date.
- [Semantic Layer Sync][ref-sls] is suspended, which prevents scheduled syncs
from running.
- [Monitoring integrations][ref-monitoring] are also suspended, which prevents
the export of metrics and logs.

When a deployment is resumed from auto-suspension:

- [Data model][ref-data-model] compilation would need to be done from scratch.
It applies to all tenants in case [multitenancy][ref-multitenancy] is set up.
Consequently, one or more queries served after a deployment is resumed from
auto-suspension are likely to have suboptimal performance.
- [Refresh worker][ref-refresh-worker] would need to refresh all
pre-aggregations that became stale during the suspension, competing for the
query queue with API instances and compromising the end-user experience.

## Configuration

To enable auto-suspension, navigate to <Btn>Settings → Configuration</Btn>
of your Cube Cloud deployment and ensure that <Btn>Enable auto-suspend</Btn>
is turned on:

<Screenshot
  highlight="inset(81% 30% 1% 34% round 10px)"
  src="https://ucarecdn.com/b0a3f38d-6631-47a8-b952-45747cf5255c/"
/>

To configure how long Cube Cloud should wait before suspending
the deployment, adjust <Btn>Auto-suspend threshold</Btn>. For best experience,
it's not recommended to choose anything below 1 hour.

The deployment will temporarily become unavailable for reconfiguration; this
usually takes less than a minute.

## Resuming a suspended deployment

To resume a suspended deployment, send a query to Cube using the API or by
navigating to the deployment in Cube Cloud.

<WarningBox>

Currently, Cube Cloud's auto-suspension feature cannot guarantee a 100% resume
rate on the first query or a specific time frame for resume. While in most
cases, deployment resumes within several seconds of the first query, there is
still a possibility that it may take longer to resume your deployment. This can
potentially lead to an error response code for the initial query.

</WarningBox>

Deployments typically resume in under 30 seconds, but can take significantly
longer in certain situations depending on two major factors:

- **Data model:** How many cubes and views are defined.
- **Query complexity:** How complicated the queries being sent to the API are

Complex data models take more time to compile, and complex queries can cause
response times to be significantly longer than usual.

[ref-caching-preaggs-gs]: /product/caching/getting-started-pre-aggregations
[ref-deployment-dev-instance]:
  /product/deployment/cloud/deployment-types#development-instance
[ref-deployment-prod-cluster]:
  /product/deployment/cloud/deployment-types#production-cluster
[ref-prod-multi-cluster]: /product/deployment/cloud/deployment-types#production-multi-cluster
[ref-deployment-pricing]: /product/deployment/cloud/pricing
[ref-workspace-dev-api]: /product/workspace/dev-mode
[ref-monitoring]: /product/workspace/monitoring
[ref-data-model]: /product/data-modeling/overview
[ref-multitenancy]: /product/configuration/advanced/multitenancy
[self-effects]: #effects-on-experience
[ref-refresh-worker]: /product/deployment#refresh-worker
[ref-sls]: /product/apis-integrations/semantic-layer-sync#on-schedule